Serveur d'exploration autour du libre accès en Belgique

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Open data and open code for big science of science studies

Identifieur interne : 000278 ( Main/Exploration ); précédent : 000277; suivant : 000279

Open data and open code for big science of science studies

Auteurs : Robert P. Light [États-Unis] ; David E. Polley [États-Unis] ; Katy Börner [États-Unis]

Source :

RBID : Pascal:14-0270434

Descripteurs français

English descriptors

Abstract

Historically, science of science (Sci2) studies have been performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a "Big Science" approach (Price, Little science, big science, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big Sci2 studies utilize "big data", i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stake-holders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big Sci2 studies. The open access Scholarly Database (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Sci2 tool (http:// sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Open data and open code for big science of science studies</title>
<author>
<name sortKey="Light, Robert P" sort="Light, Robert P" uniqKey="Light R" first="Robert P." last="Light">Robert P. Light</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Polley, David E" sort="Polley, David E" uniqKey="Polley D" first="David E." last="Polley">David E. Polley</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Borner, Katy" sort="Borner, Katy" uniqKey="Borner K" first="Katy" last="Börner">Katy Börner</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">14-0270434</idno>
<date when="2014">2014</date>
<idno type="stanalyst">PASCAL 14-0270434 INIST</idno>
<idno type="RBID">Pascal:14-0270434</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000004</idno>
<idno type="stanalyst">FRANCIS 14-0270434 INIST</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000013</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000131</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000005</idno>
<idno type="wicri:doubleKey">0138-9130:2014:Light R:open:data:and</idno>
<idno type="wicri:Area/Main/Merge">000278</idno>
<idno type="wicri:Area/Main/Curation">000278</idno>
<idno type="wicri:Area/Main/Exploration">000278</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Open data and open code for big science of science studies</title>
<author>
<name sortKey="Light, Robert P" sort="Light, Robert P" uniqKey="Light R" first="Robert P." last="Light">Robert P. Light</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Polley, David E" sort="Polley, David E" uniqKey="Polley D" first="David E." last="Polley">David E. Polley</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Borner, Katy" sort="Borner, Katy" uniqKey="Borner K" first="Katy" last="Börner">Katy Börner</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Scientometrics : (Print)</title>
<title level="j" type="abbreviated">Scientometrics : (Print)</title>
<idno type="ISSN">0138-9130</idno>
<imprint>
<date when="2014">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Scientometrics : (Print)</title>
<title level="j" type="abbreviated">Scientometrics : (Print)</title>
<idno type="ISSN">0138-9130</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithm</term>
<term>Data analysis</term>
<term>Database</term>
<term>Growth</term>
<term>Interdisciplinary field</term>
<term>Patents</term>
<term>Scientific research</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Analyse donnée</term>
<term>Interdisciplinaire</term>
<term>Base de données</term>
<term>Brevet</term>
<term>Algorithme</term>
<term>Croissance</term>
<term>Recherche scientifique</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Base de données</term>
<term>Brevet</term>
<term>Recherche scientifique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Historically, science of science (Sci2) studies have been performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a "Big Science" approach (Price, Little science, big science, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big Sci2 studies utilize "big data", i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stake-holders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big Sci2 studies. The open access Scholarly Database (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Sci2 tool (http:// sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Indiana</li>
</region>
</list>
<tree>
<country name="États-Unis">
<region name="Indiana">
<name sortKey="Light, Robert P" sort="Light, Robert P" uniqKey="Light R" first="Robert P." last="Light">Robert P. Light</name>
</region>
<name sortKey="Borner, Katy" sort="Borner, Katy" uniqKey="Borner K" first="Katy" last="Börner">Katy Börner</name>
<name sortKey="Polley, David E" sort="Polley, David E" uniqKey="Polley D" first="David E." last="Polley">David E. Polley</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Belgique/explor/OpenAccessBelV2/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000278 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000278 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Belgique
   |area=    OpenAccessBelV2
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:14-0270434
   |texte=   Open data and open code for big science of science studies
}}

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Dec 1 00:43:49 2016. Site generation: Wed Mar 6 14:51:30 2024